SCALABLE MONITORING METHOD AND DEVICE 


The present invention relates to a method for the scalable monitoring of a running 
computer system. The computer system is constituted by a set of computer equipment units 
called distributed resources, each being interconnected to the others through a communication 
network. 

Large companies have an increasing number of units of computer equipment to 
manage. These units, connected to one another through a communication network called a 
local area network (LAN), are managed by a manager. In order to manage (supervise, act on, 
monitor, control) these units remotely from one point, an architectural model comprising a 
manager and an agent, for example of the SNMP type, is most commonly adopted. In this 
architectural model, the (SNMP) agents implemented in the equipment units of the network 
mtorm the manager of the status of each of the units managed. In the vast majority of cases, 
this manager manages several thousand units of distributed equipment in one or several 
countries. 

In order to characterize the behavior of a set of resources interconnected through a 
network, any monitoring method synthesizes, and therefore calculates values known as 
mdicators, which make it possible to characterize the operation and the status of these 
resources. The calculation of an indicator is performed by an agent called an "indicator 
agent," which periodically calculates a value resulting from a function that combines values 
measured by means of an SNMP or CMIP protocol layer with values calculated by other 
indicator agents. 

Traditionally, this monitoring is performed by the manager, which centralizes the 
acquisition of the measurements and the calculation of the indicators. The information 
exchanged between the manager and the resources flows through a wide area network, also 
called a "WAN." However, the cost of bandwidth in the WAN is not adapted to monitoring. 
Likewise, the exchanges of information involved in centralized monitoring generate heavy 
usage of the WAN. This problem is explained by the fact that the bandwidth of the WAN is 
too small for the ever-increasing amount of information that must pass between the managers 
and their equipment. Therefore, the object of the present invention is to eliminate the 
drawbacks of the prior art by offering a scalable monitoring method that makes it easy to 
reorganize the architecture of the computer system when it changes. 


This object is achieved through the fact that the method for the scalable monitoring of 
a computer system comprising a plurality of computer equipment units constituting hardware 
resources to be monitored forming a monitored domain, the method being implemented by 
means of a central computer system called a manager connected to a communication network 
5 that allows the transfer of information between at least one resource and the manager, is 
characterized in that it comprises: 

- a step for breaking the monitored domain down into monitored subdomains 
comprising a predetermined maximum number of resources, 

- a step for automatically creating and configuring, for each subdomain, an 

10 information synthesis node comprising at least one synthesis agent stored in the storage 
means of a resource; each synthesis agent is designed to synthesize indicator values 
calculated and stored in the storage means of at least one resource, these indicators 
representing an operational status of the resources of the subdomain and being evaluated by 
indicator agents installed in these resources, each indicator agent being uniquely identified by 

f 5 the name of the indicator it calculates and by the subdomain in which it is installed and being 
associated with each synthesis agent using the corresponding indicator value, 

- a step for modifying the associations between the synthesis agents and the indicator 
agents when the predetermined maximum number of resources in a subdomain is reached, in 
order to accommodate the addition or deletion of indicators so that the new architecture of the 

20 monitored domain comprises, in each subdomain, a number of resources lower than the 
== predetermined maximum number of resources. 

According to another characteristic, the step for configuring a synthesis node 
comprises, for each synthesis agent: 

- a step for searching, in a table stored in the storage means of a resource, for the 

25 name of the indicator agent or agents required to calculate the indicator of the synthesis agent 

- a step for the subscribing the synthesis agent to the indicator agents found during the 
search step, this subscription step allowing each synthesis agent to automatically receive, in 
its subscription table stored in the storage means of a resource, the new values of the 
indicators found. 

30 According to another characteristic, the search step comprises: 

- a step for the sending by the synthesis agent of a notification to a naming service 
dedicated to storing the associations between a subdomain name, an indicator agent and an 
indicator, this notification comprising the name of a given subdomain and a given indicator. 
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According to another characteristic, the modification step comprises: 

- a step for installing at least one indicator agent in each new resource added to a 
subdomain, 

- a step for sending, to the synthesis agents requiring the value of the indicator of the 
5 new indicator agent or agents, a notification comprising the identification of the new 

indicator agent or agents, 

- a step for subscribing each synthesis agent to the new indicator agents required to 
calculate the indicator of the synthesis agent. 

According to another characteristic, the modification step comprises: 
10 - a step for selecting, for each subdomain, the resources to be deleted 

- a step for sending, to the synthesis agents using the value of the indicator of the 
indicator agent or agents installed in the selected resource or resources, a notification 
comprising the identification of the deleted indicator agent or agents 

- a step for unsubscribing the synthesis agents from the indicator agents whose 
:l 5 indications are contained in the notification. 

According to another characteristic, the maximum number of resources per 
subdomain is determined either so that the cost of calculating the indicators is as low as 
possible, or so that the number of synthesis nodes is as low as possible. 

A second object of the invention consists of offering a scalable monitoring device that 

20 makes it easy to reorganize the architecture of the monitoring system when the size of the 
monitored domain, i.e. the number of elements to be monitored, changes. 

This object is achieved through the fact that the device for the scalable monitoring of 
a computer system comprising a plurality of computer equipment units constituting hardware 
resources to be monitored forming a monitored domain is characterized in that it comprises 

25 mans for breaking the monitored domain down into monitored subdomains comprising a 
predetermined maximum number of resources, means for creating and configuring, in the 
storage means of a resource, information synthesis nodes comprising at least one synthesis 
agent stored in the storage means of at least one resource and designed to synthesize indicator 
values calculated and stored in the storage means of a resource, these indicators representing 

30 an operational status of the resources of the subdomain and being evaluated by indicator 

agents installed in these resources, each indicator agent being uniquely identified by the name 
of the indicator it calculates and by the subdomain in which it is installed, the configuration 
of a synthesis agent comprising the storage, in the storage means of a resource, of the 


associations between the synthesis agent and indicator agents, means for modifying the 
associations between the synthesis agents and the indicator agents when the predetermined 
maximum number of resources in a subdomain is reached, so that the new architecture of the 
monitored domain comprises, in each subdomain, a number of resources lower than the 
5 predetermined maximum number of resources. 

According to another characteristic, the means for configuring a synthesis node 
comprise means for searching, in a table stored in the storage means of a resource, for the 
name of the indicator agent or agents required to calculate the indicator of the synthesis 
agent, and means for subscribing the synthesis agent to the indicator agents found during the 

] 0 search step, these subscription means allowing each synthesis agent to automatically receive, 
in its subscription table stored in the storage means, the new values of the indicators found. 

According to another characteristic, the search means comprise means for the sending 
by the synthesis agent of a notification to a naming service dedicated to storing, in a table 
stored in the storage means of a resource, the associations between subdomain name, an 

IS indicator agent and an indicator, this notification comprising the name of a given subdomain 
and a given indicator, and means for the sending by the naming service of a notification to the 
requesting synthesis agent, comprising the name of the indicator agent or agents 
corresponding to the association of the given subdomain and the given indicator. 

According to another characteristic, the modification means comprise means for 

20 creating and storing at least one indicator agent in each new resource added to a subdomain, 
means for sending, to the synthesis agents requiring the value of the indicator of the new 
indicator agent or agents, a notification comprising the identification of the new indicator 
agents or agents, means for subscribing each synthesis agent to the new indicator agents 
required to calculate the indicator of the synthesis agent. 

25 According to another characteristic, the modification means comprise means for 

selecting, for each subdomain of the resources to be deleted, means for sending, to the 
synthesis agents using the value of the indicator of the indicator agent or agents installed in 
the selected resource or resources, a notification comprising the identification of the deleted 
indicator agent or agents, and means for unsubscribing the synthesis agents from the indicator 

30 agents whose identifications are contained in the notification. 

According to another characteristic, the maximum number of resources per 
subdomain is determined either so that the cost of calculating the indicators is as low as 
possible, or so that the number of synthesis nodes is as low as possible. 


Other characteristics and advantages of the present invention will emerge more 
clearly through the reading of the following description, given in reference to the attached 
drawings, in which: 

- Fig. 1 represents a simplified schema of a computer system in which the method 
5 according to the invention can be applied; 

- Fig. 2 represents an exemplary architecture of the links between monitoring agents 
according to the method of the invention 

- Fig. 3 represents the process for deploying a monitoring method. 

As explained above, a computer system (1) comprises at least one local area network 
10 ( 10, 20) that communicates with a central system (2) or manager through a wide area network 
(3). Each local area network (10, 20) comprises at least one unit of computer equipment (101, 
102, 201, 202) called a resource. 

In the prior art, the monitoring of all the resources (101, 102, 201, 202) is handled by 
means of remote agents (SNMP or CMIP agents, depending on the management protocol in 
J 5 question) installed in each resource ( 1 01 , 1 02, 20 1 , 202) . These agents allow a manager to 
collect information on the resources (101, 102, 201, 202) by measuring given parameters, for 
example by sending an SNMP request GetRequest to the SNMP agent in question, then to 
transmit the collected information to the manager (2) by means of a request, for example 
GetResponse in the SNMP world. The manager (2) then evaluates the indicators from this 
2-Q received information. It is understood that all the requests sent by the manager (2) in order to 
gather monitoring information and the responses to these requests from the agents produces 
considerable congestion in the wide area network (3), consequently diminishing its 
performance. 

According to the invention, all or part of the evaluation of the indicators is distributed 

25 at the local area network level (10, 20) to indicator agents. 

To do this, for each indicator (In) characterizing a particular operation or status of a 
resource (101, 102, 201, 202), a specific indicator agent (Al, A2, Bl, B2) is installed in the 
resource (101, 102, 201, 202). In the exemplary embodiment represented in Fig. 1, each agent 
(Al, A2, Bl, B2) is responsible for evaluating a different indicator (!„). In other words, each 

30 indicator (!„) makes it possible to determine a different operation or a different status in each 
resource (101, 102, 201, 202). However, each agent (Al, A2, Bl, B2) described in reference 
to Fig. 1 can be present, for example, in all of the resources (101, 102, 201, 202) or all the 
resources of the same type in the computer system (1). 
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For example, a first agent (Al) can evaluate an indicator (Iai) whose formula makes it 
possible to know, for example, the number of alarms per minute emitted by a first resource 
(101). It is understood that this agent (Al) can be installed in all the resources (101, 102, 201, 
202) of a computer system (1). 

The indicator agents (Al, A2, Bl, B2) are elementary programming and execution 
entities. The indicator agents (Al, A2, Bl, B2) are autonomous in order to give the structure 
the property of scalability by making it easy to add or delete one or several indicator agents in 
the architecture. They are made autonomous by correctly defining the interface for these 
agents. They are also autonomous in terms of communication. They communicate using 
notifications and not procedure calls, which makes it possible to open up the architecture. 
This communication is asynchronous, thereby making it possible to obtain a better 
parallelization of the indicator agents and hence a better scalability of the architecture. 

The indicator agents (Al, A2, Bl, B2) can be persistent objects, and their status is 
derived from data that, in this case, can exist outside any execution structure and can be 
stored in persistent E^PROM or hard disk memories. An indicator agent (Al, A2, Bl, B2) is 
an independent machine that reacts to notifications. 

Each agent (Al, A2, Bl, B2) is an object of a class and is identified by a respective 
identifier id(Ai), id(A2), id(Bi), id(B2) that is unique in the computer system (1) so that 
communications between indicator agents are possible, and that makes it possible to locate 
them. An agent can be created in a remote server. The creation entity is responsible for 
creating the identifier. The structure of the identifier must take into account the static location 
of the agents in order to allow the system to forward the notifications, and to allow local 
"production" of the identifiers for the remote agents. The identifier of an agent comprises the 
following three parts: 

- the identification (idsac) of the agent server hosting the creation agent, 

- the identification (idsal) of the agent server hosting the agent created, and 

- a postmark (el), local to the agent server hosting the creation agent. 

A "factory" agent is present in all of the agent servers and is responsible for creating 
objects in the server It makes it possible to manage the creation of remote agents. 

According to the invention, each indicator agent (Al, A2, Bl, B2) is responsible for 
evaluating one indicator (!„). In order to do this, it must comprise properties that make it 
possible to collect a measurement (Mn) on the resource (101, 102, 201, 202) with which it is 
associated, and communication properties in order to request or receive, as necessary, at least 


one other indicator evaluated by another indicator agent (Al, A2, Bl, B2) according to the 
invention, for example by means of notifications. These properties are obtained by means of 
specific program modules constituting each indicator agent (Al, A2, Bl, B2). Thus, an 
indicator agent (Al, A2, Bl, B2) comprises a collection module and/or a communication 
module. Likewise, an indicator agent (Al, A2, Bl, B2) also comprises an evaluation module 
for calculating the indicator itself from the formula, using the information collected by the 
other two modules. In this evaluation module, the equation that defines the value of an 
indicator (Im for example) can refer to a possibly empty set of other indicators (Ia2, Ibi, for 
example). 

The indicator agents according to the invention use a communication protocol that is 
preferably asynchronous, which means that when an indicator agent sends a request for 
collecting a measurement on a resource of the system or a notification to another indicator 
agent, it does not wait for the response to this request or notification before possibly sending 
a new request or notification. In other words, the sending of requests and notifications is 
performed in parallel. Likewise, the management protocol can be asynchronous. For 
example, the asynchronous management protocol used is the protocol SNMP or the protocol 
CMIP, and it makes it possible to parallelize the acquisition of the data and the calculation of 
the indicators for which the data is available. 

However, it is possible to use a synchronous protocol, but the performance in that 
case is inferior, particularly in terms of speed in evaluating the indicators. Moreover, it is 
appropriate to install an asynchronous device for collecting and storing measurements so that 
the indicator agent that sent the request need only consult the memory of this measurement 
collecting device. This device is, for example, a particular management information base 
(MIB), called an Rmon MIB by one skilled in the art. 

Each indicator agent manages a so-called subscriber list on which the names of other 
indicator agents according to the invention may be written. This list is stored in the storage 
means of the computer equipment unit associated with the indicator agent, for example in the 
form of a table (1010, 2010). An indicator agent Ai is written on this list by sending a 
specific so-cahed subscription notification "Subscribe (id(Ai) Management Information 
(A]))" to another indicator agent Bi, which calculates the indicator (Ibi). This notification 
includes as parameters a piece of so-called management information that allows the sending 
agent to create an association between a propagation of a value modification and the other 
indicator agent (Bj), and the identifier id(Ai). Upon receiving a subscription notification, the 


destination agent (Bl) processes the notification by writing into the subscriber table (2010) 
the identifier (Mai) of the sending agent as well as the management information 
(Management Information (A|)) on the list. This hst is consulted by the indicator agent (Bi) 
that manages it, after the evaluation of the indicator (Igi) by the agent (Bj). If the new value 
5 of the indicator is different from the value previously evaluated and stored by the agent, then 
the agent sends each agent written on the subscriber list (2010) and identified by the 
parameter (id(Ai)) a value change notification (ValueChanged) comprising the new value 
(Val(lBi)) of the indicator (Ibi). To do this, after the evaluation of the indicator (Ibi), a 
comparison module of the indicator agent (B]) compares the new value of the indicator 

10 (Val(lBi)) to the value previously calculated (Valp(lBi)) and stored. If the two values are 
different, the comparison module activates a procedure for sending the value change 
notification (ValueChanged) to all of the agents, for example A, written on the subscriber hst 
(2010) then records the new value of the indicator in the storage means of its computer 
equipment unit. The value change notification comprises as parameters the new value of the 

15 indicator (Val(lBi)) and the management information (Management Information (A,)) of the 
target agent, so that the target agent can assign the value received to the indicator awaited. 

According to the invention, the monitored domain constituted by the set of resources 
(101, 102, 201, 202) is organized into a plurality of subdomains (d]={Ai, A2}; d2={B], B2}). 
Each of the subdomains contains either a set of resources to be monitored, and/or a set of 

20 subdomains. 

A subdomain is also characterized by the indicator agents running in the resources 
constituting the subdomain. For each subdomain, a set of agents called "synthesis agents" is 
installed in a particular resource so as to constitute a synthesis node. This resource can be a 
dedicated monitoring resource or a general-purpose resource chosen because of its technical 
25 characteristics to support the synthesis node. Moreover, this resource could also be a given 
resource of the subdomain or of another subdomain. In fact, one resource can support the 
monitoring of several subdomains, for example ranging from 0 to n, if this resource is 
capable of handling the load produced by the calculations performed by a plurality of sets of 
synthesis agents. 

30 Basically, the structure of a synthesis agent is identical to that of an indicator agent; 

the difference in terms of behavior and utilization lies mainly in the formula representing the 
indicator to be evaluated. 


During the configuration of the monitoring of the monitored domain according to the 
invention, a naming service is used to define and maintain the associations between the name 
of the subdomain and the identification of the indicator agent responsible for evaluating an 
indicator of the subdomain. 
5 In other words, the naming service (SN) supplies, for each subdomain, all of the 

indicator agents present in this subdomain and associates each indicator agent (AO with the 
indicator (lai) it calculates and with the subdomain (dl) to which it belongs, for example in a 
table (4) or in an association file. This naming service SN is then used during the 
configuration of each synthesis agent. This naming service can be either centralized or 
10 distributed. When the naming service is centralized, as represented in Fig. 1, the information 
relative to the association between a subdomain and an indicator agent is centralized in a 
single place in the monitored domain represented, for example, by a resource of the 
monitored domain. 

When the naming service is distributed, the naming service associated with each agent 
15 machine has the names of the agents it manages, as well as the subdomain to which each 
agent belongs. 

The agent machine manages a certain number of properties, i.e. the atomic aspect of 
an agent's reaction to a notification and the delivery of the notifications in a causal sending 
order. 

20 The agent machine handles the distribution of the notifications, the transmission of 

the notifications and the management of the overall atomicity. The agent machine comprises 
various entities such as an execution engine and a communication channel, this channel 
having two queues, a local queue and an external queue. 

The agent machine is distributed. The agent machines communicate with each other; 

25 the agents do not see this distribution since they communicate through notifications and 
address their local agent machine. An agent's reaction is always local. 

The centralized or distributed naming service SN is necessary for the configuration of 
the synthesis agents (As). In essence, as a general rule, a synthesis agent is designed to 
evaluate an indicator whose formula includes at least one set-oriented operator such as a sum, 

30 an average, or the determination of a minimum or a maximum in one or more indicators of 
the subdomain with which the synthesis agent is associated. However, a synthesis agent can 
also evaluate an indicator representing the general operation of the subdomain with which it 
is associated. 


-9- 


Thus, the configuration of a synthesis agent requires the latter to subscribe to all the 
indicator agents of the subdomain monitored by the synthesis agent that evaluate the indicator 
or indicators contained in the formula of the indicator of the synthesis agent. To do this, the 
synthesis agent (As) sends the naming service (SN), for each indicator required for the 
5 evaluation of its own indicator, a notification requesting an indicator lookup 

(indicatorLookupReq) that specifies the name of the subdomain (dl) that the synthesis agent 
(As) monitors, as well as the names of the indicators required for the evaluation of its own 
indicator. In response to this notification, the naming service performs a search in order to 
find the names of all the agents that perform the evaluation of the requested indicator in the 

10 subdomain specified in the notification. The naming service constructs and then sends a 

response notification (indicatorLookupResp) to the requesting synthesis agent containing the 

_^ name of the domain (dl) and the name of the indicator agent or agents (Al, A2), if they exist. 

=0 This response notification is processed by the synthesis agent, which extracts the names of 

Ld the indicator agents and activates, for each indicator agent extracted, the subscription 

® procedure described above. 

O In a variant of embodiment, when no indicator agent corresponding to the subdomain 

name/indicator name association exists, the naming service can include means for storing the 

U name of the synthesis agent that sent the notification and the name of the indicator requested. 

Q Thus, as soon as a new indicator agent is installed in the subdomain, it is detected by the 

^ detection means of the naming service (SN) or is declared to the latter by the installer. The 

1^ naming service includes means for performing an update of the subdomain name/indicator 

name associations and for verifying whether the name of the indicator corresponds to a stored 
indicator name. If so, means in the naming service construct, then send, the response 
notification (indicatorLookupResp) to the agents whose names have been previously stored 

25 that have requested the name of the agent responsible for the new indicator detected. 

In another variant, the naming service uses an agent creation tool, such as an agent 
machine, to construct the agent requested. In other words, the naming service constructs an 
indicator agent that evaluates the requested indicator in the specified subdomain. 

According to the invention, the number of resources per subdomain is lower than a 

30 predetermined maximum number. This number is determined as a function of the monitoring 
policy chosen. For example, the maximum number of resources per machine is determined so 
that the cost of calculating the indicators of the synthesis agents is as low as possible, in order 
to reduce the calculation load in the resource or resources supporting the synthesis agents. 


Another possibility consists of determining the maximum number of resources per domain so 
that the number of synthesis nodes is as low as possible, in order to reduce the number of 
resources responsible for monitoring and to concentrate the information representing the 
monitoring. 

5 Before further describing the mechanisms that make it possible to reorganize the 

architecture of the monitoring according to the invention, we will describe, in reference to 
Fig. 3, the method that makes it possible to deploy a monitoring configuration. 

The deployment of a monitoring configuration consists of performing the 
instantiation, i.e. the creation, of the indicator agents for the indicators defined by the hst 

10 {(dl.Il), (di.Ij), (dn.In)}, in which the indicator Ij must be evaluated in the subdomain 
di. To do this, the deployment method uses an agent called a configuration deployment agent 
(ADC). This configuration deployment agent handles the creation of agents called 
configuration agents (AC). Thus, for each resource of a monitored subdomain, the 
configuration deployment agent (ADC) creates a configuration agent (ACa, ACb), which 

15 handles the process of creating the indicator agents (Aj, Bi, Ci) specified by the configuration 
for the resource that has been assigned to it. Thus, the deployment of the various 
configurations is parallelized in each of the subdomains to be monitored, hi essence, the 
configuration deployment agents are created for each subdomain to be monitored in parallel 
fashion. 

2X) The configuration deployment agents (ADC) and the configuration agents (AC) have 

the same characteristics as the indicator agents, i.e. they are also managed by at least one 
agent machine of at least one resource of the domain to be monitored. The location of the 
configuration agents is of little importance; the various configuration agents can reside in the 
same resource or can be deployed in different resources, including even monitored resources. 

25 As explained above, a configuration agent (AC) handles the creation of the indicator 

agents specified by the configuration for a given resource M belonging to at least one of the 
subdomains (di) for which this indicator !„ must be created. To this end, for any indicator !„ 
of the configuration that can be instantiated in the resource M, the configuration agent creates 
an agent called an indicator deployment agent (ADIa, ADIb, Fig. 2) responsible for the 

30 deployment of the indicator or indicators I in the resource M. 

The indicator deployment agent (AD I) can be created in the same resource as the 
configuration agent (AC) or in a different resource. 
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In a variant of embodiment in which the programming language used is the "Java" 
language, the configuration agents (AC) will induce the dynamic loading of the classes 
"I_Deployer" and "I_hidicator", using the mechanisms defined by Java runtime. When these 
classes are not present in the resource in which the configuration process is executed, the 
5 configuration agent receives an exception of the "class not found" type 

"ClassNotFoundException", which activates, at the configuration agent level, means for 
downloading the software elements it requires in the resource, thereby incrementally 
deploying the software elements required for the monitoring, from a minimal kernel. Thus, 
the monitoring method according to the invention provides a solution to the dual problem of 

] 0 configuring a distributed monitoring, i.e., deploying the software configuration and 
configuring the monitoring. 

An indicator deployment agent (ADI) is an agent that determines, for a given type of 
indicator, the various combinations of the values of the variables for which the indicator will 
be instantiated. It therefore handles both the name resolution process (described below) and 

t5 the creation of the indicator agents, as well as their declaration to the naming service (SN). 
The names of the objects referenced by the indicator agent that calculates the 
indicators, as well as the identifications of the agents that calculate the indicators referenced 
during the calculation, are part of the parameters for the creation of any indicator agent 
during its instantiation by the indicator deployment agent. 

20 For any indicator !„, an indicator compiler, after analyzing the equation that defines 

the indicator, generates two object classes "I_Deployer" and "^Indicator", which respectively 
correspond to the indicator deployment agents that deploy the instances of the class 
"I_Indicator" responsible for evaluating the indicator and to the indicator agents that evaluate 
the indicator. The class "I_Deployer" makes it possible to know which indicator agents 

25 identified by the class "I_Indicator" must be created, and makes it possible to declare to the 
naming service (SN) the indicator agents actually created. 

An indicator deployment agent has explicit knowledge of the indicators and the 
identifiers of the objects referenced by the equation. Each of these object identifiers { Idi , . . . , 
Idm} defines a structure wherein certain elements can be variables. 

30 When the management protocol chosen is the protocol SNMP, the indicator 

deployment agent executes the process (described below) for resolving the names of the 
objects referenced in the equation or the formula of the indicator and creates the 
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corresponding indicator agents by determining the valid combinations of the values of the 
variables. 

In essence, the equation that defines the calculation of the value of an indicator refers 
to objects identified by {Idi, Idm} using a possibly empty set of variables {Vj, 
Each object identifier Idj is associated with a set of variables {Wi, ... Wk} belonging to the 
set {Vi, Vn}. Determining the first valid combination of the values of the variables Vi 
consists of applying a process (described below) for searching through the identifiers Idj, for 
example in the order 1 through m, in order to progressively instantiate all the variables { V], 
. . . , Vn } and hence to calculate the identifiers { Idi , . . . , Id^ } . 

When all of the objects identified by {Idj, Idm) in which the variables have been 
replaced with their corresponding values belonging to the combination of values {¥],...,¥„} 
exist, the search process of the indicator deployment agent (ADI) verifies whether or not the 
constraint expressed in the values of the variables is satisfied. The indicator deployment agent 
(ADI) instantiates the indicator agent only when this last constraint is met. The objects 
actually referenced by the equation of the indicator are those identified by {Id,, Idm} in 
which the variables have been replaced by their corresponding values belonging to the 
combination of values {¥],...,¥„}. These objects are passed to the indicator agent as a 
parameters during the creation of the indicator agent. 

In order to find the subsequent valid combinations of the variables Vi, the search 
process of the indicator deployment agent (ADI) searches for the index k for which there 
exists a subsequent element for Idk, for k varying between m and 1. If such a value of k does 
not exist, then the search process is terminated. 

If such a value of k does exist, then the search process is applied to the identifiers Id;, 
in the order k+1 through m for example, in order to progressively instantiate all the variables 
{ Vi, Vn} and hence to calculate the identifiers {Id], Idm}. 

The search process therefore makes it possible to calculate all the objects {Idi, ..Idn} 
of the equation that represent an indicator. 

The search process comprises a process for binding the variables, defined as follows, 
which makes it possible to determine the various variables of the identifiers Idi. Let Id be an 
object identifier in which the variables {Wi, ...Wk} belong to the set {Vj, Vn} mentioned 

above. The identifier Id has the form a,.W,.a2.W2 ak.Wk.ak+i, in which the coefficients aj 

have the structure of an object name in the ASN.l sense of the term, which means that the 
coefficients ai are constituted by a sequence of positive integers. 


Let j be the index for which the variables with successive indices {Wj, Wj} are 
already bound. Finding all of the objects that can be designated by this identifier consists of 
navigating through the instantiated MIB, searching for the objects, for example by launching 
a request of the SNMP "GetNextRequest" type, using as a parameter an object identifier 

whose identifiers begin with the root R = a1.W1.a2.W2 aj.Wj and by applying an object 

identifier unifiability criterion, defined below. Let Id' be the response to the SNMP request 
GetNextRequest applied to the root R, if such an object exists. The object does not exist if the 
response to the request GetNextRequest indicates that the designated object does not exist, in 
which case the search is terminated. 

If Id and Id' are unifiable according to the criterion defined below, then the variables 
{W,, Wk) are all bound, which means that they all have a value, and we have just found 
the identifier of an object that can be designated by Id: Id'. 

In order to find the next valid combination of the variables {Wi, Wk}, knowing 
Id', it is necessary to reiterate the preceding process, using Id' as the starting point for the 
search, after having unbound, i.e., retrieved the previous values of, the variables Wi that were 
not bound at the very beginning of the preceding search process. 

If Id and Id' are not unifiable and the identifier Id' begins with R, then the search 
continues as before, using Id' as the starting point and unbinding any variable Wi that was not 
bound at the very beginning of the preceding search process. 

If Id' does not begin with R, then there is no longer any existing object whose 
identifier could be unified with Id. The search process is then terminated. 

The unifiability criterion is defined as follows. Let Id' be a reference identifier 
(without a variable part) wherein all the elements are integer values; two identifiers Id and Id' 
are unifiable if both identifiers are the same size and if the identifiers Id.ai = Id'.ai for any i 
belonging to [1, k+1]. Let {wj, Wk} be the integer values of the idenfifier Id' that 
correspond to the variables {Wi, Wk} of Id, i.e. that are located at the same position in the 
corresponding identifier. The binding of the variables Wi takes place sequentially for indices 
varying between 1 and k through the following method: 

if Wi is bound, i.e., if Wi already has a value, then this value must be equal to Wi; if 
the latter condition is not satisfied, then Id and Id' are not unifiable and the binding of the 
variables {W3, ...Wk} is not possible; 

if Wi is free, i.e., if Wi does not yet have a value, then the variable Wi will be bound to 
the value Wi. 
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For example, let Id = 1.2.3.Wi.4.5.W2.6.7 and Id' = 1.2.3.10.4.5.20.6.7. These two 
object identifiers are unifiable for variables of W] and W2 whose values are 10 and 20. On the 
other hand, if Id = 1.2.3.Wi.4.5.W,.6.7 and Id' = 1.2.3.10.4.5.20.6.7, then Id and Id' are not 
unifiable because the variable Wi cannot have the values 10 and 20 at the same time. 
5 The identifier unification process makes it possible both to verify that two identifiers 

are unifiable and to determine for which values of the variables that are still free prior to the 
start of the unification process this unification is possible. 

When the name resolution process has exhausted all the valid combinations, the 
indicator deployment agent (ADI) becomes unnecessary if the monitored configuration is 
10 never changed. 

When the software and/or hardware configuration of the monitored machine can 
change, for example, according to the invention, when the predetermined maximum number 
of machines per subdomain has been reached, the indicator deployment agents are reactivated 
in order to re-evaluate the hardware and software configuration by performing a search for 
i5 the instantiable elements, instantiating only those of the indicator agents that don't already 
exist, and deleting those that no longer have any reason to exist. 

In a variant of embodiment, when the programming language is Java, the creation of a 
configuration agent, an indicator deployment agent or an indicator agent consists of sending 
an agent creation request "agentCreateRequest" to the "factory" agent of the agent machine 
20 that manages the subdomain in which the agent must be deployed. This request includes as 
parameters the identification of the agent and the status of the agent. The status of the agent 
corresponds to the serialized Java object of this agent. When the "factory" agent receives this 
request, it deserializes the serialized Java object and thereby obtains the requested agent. 
Annex 1 provides exemplary implementations of various equations that make it 
25 possible to evaluate indicators. 

Fig. 2 represents an exemplary architecture of the links between monitoring agents 
according to the method of the invention. A link is understood to mean an agent's 
subscription to another agent's subscriber list. In this figure, the resources have been 
abstracted. In essence, the chosen installation of the indicator agents and the synthesis agents 
30 depends on the distribution policy, i.e., the criteria of choice for determining which resource 
will be used as the calculation site for this or that indicator. The monitoring of a resource can 
actually be performed, for example, in the resource itself by installing the indicator agent or 
agents in this machine. However, it is possible to perform the monitoring of this same 


resource from another, remote resource in which the indicator agent or agents for this 
resource are installed. In this case, these remote indicator agents perform the measurements 
required for the evaluation of the indicators by querying SNMP-type agents installed in the 
resource to be monitored. Likewise, a resource can be the installation site for a plurality of 
5 synthesis nodes. In fact, the synthesis agents of these synthesis nodes can be managed by just 
one agent machine. Furthermore, this same agent machine can manage a plurality of indicator 
agents. 

According to Fig. 2, the monitored domain comprises two subdomains (dl, d2). Each 
subdomain (dl, d2) comprises a given number of resources (not represented) monitored by a 

10 plurality of agents (Al.l, A1.2, A1.3, Bl.l, B1.2, B1.3). In the exemplary embodiment 
represented in Fig. 2, each agent (Al.l, A1.2, A1.3, Bl.l, B1.2, B1.3) calculates the same 
first indicator (II), but in different resources. The indicator (II) related to the agent (Al.l) is, 
for example, a function requiring the collection of a measurement (mAi.i) on the resource 
associated with the respective agent (Al.l). For example, the indicator (II) is determined by 

15 the formula indicating the number of alarms sent per minute sent a resource. Likewise, each 
indicator (II) related to the other agents (A1.2, A1.3, Bl.l, B1.2, B1.3) is a function requiring 
the collection of the respective measurements (mM.i, mAi.3., mBi.i, mBi.2, mBi.s) in each 
resource associated with each indicator agent (A1.2, A1.3, Bl.l, B 1.2, B1.3). 

Each subdomain (dl, d2) comprises a first synthesis agent (As.i, As.2) responsible for 

2Q evaluating a second indicator (12). The formula of this indicator (12) corresponds, for 

example, to the sum of the first indicators (II) in the entire subdomain associated with the 
first synthesis agent (As.i, As.2). Thus, these two agents make it possible to know the number 
of alarms per minute sent by all of the resources of each subdomain (dl , d2). A second, 
supplementary synthesis agent (As.d) can also be installed and is designed to synthesize the 

25 information from all of the monitored domains. Consequently, this second synthesis agent 
(As.d) is responsible for evaluating a third indicator (13). The formula of this third indicator 
(13) corresponds to the sum of the second indicators (12). This third indicator (13) makes it 
possible to know the number of alarms per minute sent by all of the resources of the 
monitored domain. 

30 During the configuration of the monitored domain represented in Fig. 2, each 

indicator agent (Al.l, A1.2, A1.3, Bl.l, B1.2, B1.3) is first deployed in one or several 
resources, so as to be able to collect the respective measurements (mAi.i, mAi.2, mAi.3., mBi.i, 
iHbi.2, mBi.3) required for the evaluation of the first indicator (II) by each indicator agent 
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(Al.l, A1.2, A1.3, Bl.l, B1.2, B1.3). In a second step, the monitored domain is broken down 
into subdomains (dl, d2), and each agent (Al.l, A1.2, A1.3, Bl.l, B1.2, B1.3) is assigned to 
a subdomain. In this step, the naming service (SN) updates the list (4) of associations, 
because the new indicator agents created are declared by the deployment agents using a 
5 procedure specific to the naming service by creating an association (indicatorName - 
domainName). 

Next, the first synthesis agents are installed in each synthesis node associated with 
each subdomain (dl, d2). During its configuration, each first synthesis agent will subscribe to 
each indicator agent revealed by the notification (IndicatorLookupResp) as a result of the 

10 procedure described above. In the same way, the second synthesis agent (As.d) will subscribe 
to the first synthesis agents (As.i As.a)- 

In operation, the respective indicator agents will evaluate, for example periodically, 
the first indicator (II) in each associated resource and transmit to the first synthesis agent 
(As.i As,2), if necessary, the new value of the first indicator (II). In the same way, the first 

15 synthesis agents (As.] As.2) will evaluate the second indicator (12) with the last values of the 
first indicator (II) received and transmit to the second synthesis agent (As.i As.2), if 
necessary, the new value of the second indicator (12). This second synthesis agent (As.d) also 
evaluates the third indicator (13) with the last values of the second indicator (12) received. 

According to the invention, the monitoring method comprises a step for reorganizing 

W the links between the various agents when the number of resources per subdomain reaches 
the predetermined maximum number. 

Given the architecture of the links between indicator agents and synthesis agents, a 
modification of the monitored domain essentially involves the addition and/or deletion of at 
least one resource in at least one subdomain. In other words, a reorganization consists of 

25 modifying the subscriptions of the synthesis agents in the subscriber tables (1010, 1020, 

2010) as a function of the assignment to this or that subdomain of the additional resource or 
resources. 

The addition of a resource to a subdomain follows the following procedure. 

First of all, new indicator agents are installed and configured in the new resource if 
30 the agents have not been previously installed. In essence, when a resource is moved from one 
subdomain (dl) to another subdomain (d2), there is no new installation of indicator agents, 
since they already exist. 
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Next, the configuration agent sends, to the naming service (SN) of the agent machine 
responsible for the supervision of the subdomain (di) to which the resource must be added, an 
agent addition notification (addToDomain) specifying the name of the subdomain and the 
name of the agent added. In response to this notification, the naming service searches for and 
5 determines all of the agents performing evaluations in the subdomain using the indicator 

evaluated by the new indicator agent. The naming service then sends, to each agent found, an 
object addition notification (addltem) that specifies the name of the agent added. 

Each of the agents thus notified modifies its internal status and subscribes to the new 
indicator agent through a subscription notification (Subscribe), then recalculates its value as 
1 0 soon as possible. 

The deletion of a resource from a subdomain follows the following procedure. 
The naming service (SNi) of the subdomain (di) receives, from the entity that initiated 
~ the deletion of the resource, a deletion notification (removeFromDomain) that specifies the 

name of the domain in question as well as the name of the resource to be deleted. In response 
15 to this notification, the naming service (SNi) searches for and retrieves all of the agents 
performing evaluations in the subdomain using the indicator evaluated by the deleted 
indicator agent. The naming service (SNi) sends an object deletion notification (removeltem) 
r that specifies the name of the deleted resource, thus allowing the agents to modify their 
Z status, to unsubscribe from the indicators, whose references will disappear, and to re-evaluate 
20 their value. 

As soon as this phase is finished, the indicator agent thus disconnected can, for 
example, be included in another domain by means of the procedure for adding a resource. 

It should be clear to those skilled in the art that the present invention allows for 
embodiments in many other specific forms without going beyond the scope of application of 
25 the invention as claimed. Consequently, the present embodiments should be considered as 
examples, but can be modified within the range defined by the scope of the attached claims 


ANNEX 1 


Exemplary implementation of an indicator named ifUtilizationBandWidth that does 
not refer to another indicator. The equation for this indicator is as follows: 

class IfUtilizationBandWidth_hidicator extends Primitivelndicator 
{ 

int delta_l=0; 

public IfUtilizationBandWidth_Indicator( short serverld, 

String machineName, 
int period, 
Oid[] attributes, 
Agentld[] indicators 

) 

throws UnknownHostException, lOException 

{ 

super( serverld, machineName, period, attributes, indicators ); 

} 

public AsnObject evaluate( long deltaT ) 
{ 

int ifInOctets_l= ((AsnUnsLiteger)pairs[0]. value). value; 
int ifOutOctets_l= ((AsnUnsInteger)pairs[l].value).value; 
int ifSpeed_l= ((AsnUnsInteger)pairs[2].value).value; 
int new_delta„l= ifInOctets_l + ifOutOctets_l ; 
int old_delta_l=delta_l; 
delta_ 1 =new_delta_ 1 ; 

int newValue= ( ( ( ( 8 *(new_delta_l-old_delta_l) ) / (int) deltaT ) / 
ifSpeed_l ) * 100 ); 

return( new Asnlnteger( new Value ) ); 
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public class IfUtilizationBandWidth_Deployer extends IndicatorDeployer 
{ _ 

private static String ind_pattems[]={ 
Mib2.ifInOctetst + ".1", 
Mib2.ifOutOctets + ".1", 
Mib2.ifSpeed + ".1" 

}; 

private static IndicatorReference[] ind_indicators=null; 
private static int ind_period= 1 ; 

public IfUtilizationBandWidth_Deployer( short serverld, 

String machineName, 
String[] machines 

) 

throws Exception 
{ super( serverld, machineName, ind_pattems, machines); 
this. serverld = serverld; 
this.machineName=machineName; 
this .period=ind_period; 
indicatorNames=ind_indicators; 

} 

protected Primitivelndicator createListance( short serverld, 

String machineName, 
int period, 
Oid[] attributes, 
Agentld[] indicators 

) 

throws UnknownHostException, lOException 

{ return( new IfUtilizationBandWidth_hidicator( serverld, 

machineName, 
period, attributes, 
indicators)); 

} 


This equation uses instantiated indicators iflnOctets.l, ifOutOctets.l and ifSpeed.l, 
the names of which are stored in a table used by the builder. 

The evaluation of this equation is performed by a generic method called "evaluate". 
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Exemplary implementation of the indicator named ifErrorsRatio when it refers to 
three other indicators, i.e. ifErrors.l, IflnPacketsl and IfOutPackets 1 . The equation for this 
indicator is as follows 

class IfErrorsRatio_Indicator extends Primitivelndicator 
{ 

public IfErrorsRatio_Indicator( short serverld. 

String machineName, 
int period, 
Oid[] attributes, 
Agentld[] indicators 

) 

throws UnknownHostException, lOException 

{ 

super( serverld, machineName, period, attributes, indicators ); 

} 

public AsnObject evaluate( long deltaT ) 
{ 

int IfErrors=((Asnhiteger) indicatorValues[0]). value; 
int IfInpackets=((AsnInteger) indicator Values[l]). value; 
int IfOutpackets=((AsnInteger) indicatorValues[2]).value; 
int newValue= ( IfErrors / ( Ifhipackets + IfOutpackets ) ); 

return( new Asnlnteger( new Value ) ); 

} 

} 


public class IfErrorsRatio_Deployer extends IndicatorDeployer 
{ 

private static String ind_pattems[]=null; 

private static IndicatorReference[] ind_indicators= { 
new IndicatorReference( "IfErrors_Indicator") , 
new IndicatorReference( "Iflnpacketsjndicator") , 
new IndicatorReference( "IfOutpacketsJndicator") 

}; 

private static int ind_period = 100 ; 

public IfErrorsRatio_Deployer( short serverld. 

String machineName, 
String[] machines 

) 

throws Exception 


super(serverld, machineName, ind_pattems, machines); 

this.serverld = serverld; 

thi s . machineName=machineName ; 

period=ind_period; 

indicatorNames=ind_indicators; 

} 

protected Primitivelndicator createlnstance( short serverld, 

String machineName, 
int period, 
Oid[] attributes, 
Agentld[] indicators) 
throws UnknownHostException, lOException 

{ 

return( new IfErrorsRatio_Indicator( serverld, 
machineName, 
period, 
attributes, 
indicators)); 


Exemplary implementation of a new iterator, for example SUM, correspondin 
. The equation for this iterator is as follows: 

public class Sum extends Compoundlndicator 

{ 

public Sum( int period, String[] machines. Class klass ) 
{ 

super( period, machines, klass ); 
} 

public AsnObject evaluate( long delta! ) 
{ 

int length = indicatorValues.length; 
int current = 0; 

for( int i = 0; i < length; i++ ) 
{ 

if( indicatory alues[ i ] != null ) 
{ 

current += ((Asnhiteger) indicatorValues[ i ]).value; 

}; 

}; 

return( new Asnlnteger( current ) ); 
} 

} 
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